High frequency batch-oriented computations over large sliding time windows

نویسندگان

  • Leonardo Aniello
  • Leonardo Querzoni
  • Roberto Baldoni
چکیده

Today’s business workflows are very likely to include batch computations that periodically analyze subsets of data relative to specific time ranges in order to provide strategic information for stakeholders and other interested parties. The frequency of these computations directly impacts on how much updated such information can be, which provides an effective measure of their usefulness. This in turn drives towards solutions that allow to carry out batch elaborations more often, so as to always have updated information as soon as possible. How often they can be executed usually depends either on application-specific requirements or on some other constraints, typically about completion time because it’s uncommon to start a new batch computation before the previous one has completed, and the typical amounts of data to elaborate in these scenarios are so large that a computation can take very long. In this paper we propose a model for batch processing on sliding time window event computations that allows the definition of multiple metrics for performance optimization. These metrics specifically take into account the organization of input data to minimize its impact on computation latency. The model is then instantiated on Hadoop, a batch processing engine based on the MapReduce paradigm, and a set of strategies for efficiently arranging input data is described and evaluated.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sliding Windows with Limited Storage

We consider time-space tradeoffs for exactly computing frequency moments and order statistics over sliding windows [16]. Given an input of length 2n− 1, the task is to output the function of each window of length n, giving n outputs in total. Computations over sliding windows are related to direct sum problems except that inputs to instances almost completely overlap. • We show an average case ...

متن کامل

Effective Computations on Sliding Windows

In the streaming model, elements arrive sequentially and can be observed only once. Maintaining statistics and aggregates is an important and nontrivial task in this model. These tasks become even more challenging in the sliding windows model, where statistics must be maintained only over the most recent n elements. In their pioneering paper, Datar et al. [SIAM J. Comput., 31 (2002), pp. 1794–1...

متن کامل

A Sliding Window Filter for SLAM

This note describes a Sliding Window Filter that is an on-line constanttime approximation to the feature-based 6-degree-of-freedom full Batch Least Squares Simultaneous Localization and Mapping (SLAM) problem. We contend that for SLAM to be useful in large environments and over extensive run-times, its computational time complexity must be constant, and its memory requirements should be at most...

متن کامل

Fast Human Detection Using Motion Detection and Histogram of Oriented Gradients

This paper presents a real-time Human detection algorithm based on HOG (Histograms of Oriented Gradients) features and SVM (Support Vector Machine) architecture. Motion detection is used to extract moving regions, which can be scanned by sliding windows; detecting moving region can subtract unnecessary sliding windows of static background regions under the surveillance conditions, then detectio...

متن کامل

Framework for real-time clustering over sliding windows

Clustering queries over sliding windows require maintaining cluster memberships that change as windows slide. To address this, the Generic 2-phase Continuous Summarization framework (G2CS) utilizes a generation based window maintenance approach where windows are maintained over different time intervals. It provides algorithm independent and efficient sliding mechanisms for clustering queries wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Future Generation Comp. Syst.

دوره 43-44  شماره 

صفحات  -

تاریخ انتشار 2015